Picture for Yifei Huang

Yifei Huang

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Towards Interactive Intelligence for Digital Humans

Add code
Dec 15, 2025
Viaarxiv icon

The N-Body Problem: Parallel Execution from Single-Person Egocentric Video

Add code
Dec 12, 2025
Viaarxiv icon

UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking

Add code
Dec 10, 2025
Viaarxiv icon

Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels

Add code
Dec 08, 2025
Figure 1 for Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels
Figure 2 for Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels
Figure 3 for Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels
Figure 4 for Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels
Viaarxiv icon

Can MLLMs Read the Room? A Multimodal Benchmark for Verifying Truthfulness in Multi-Party Social Interactions

Add code
Oct 31, 2025
Viaarxiv icon

Solving the Hubbard model with Neural Quantum States

Add code
Jul 03, 2025
Viaarxiv icon

Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision

Add code
Jun 06, 2025
Figure 1 for Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Figure 2 for Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Figure 3 for Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Figure 4 for Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Viaarxiv icon

Egocentric Action-aware Inertial Localization in Point Clouds

Add code
May 20, 2025
Figure 1 for Egocentric Action-aware Inertial Localization in Point Clouds
Figure 2 for Egocentric Action-aware Inertial Localization in Point Clouds
Figure 3 for Egocentric Action-aware Inertial Localization in Point Clouds
Figure 4 for Egocentric Action-aware Inertial Localization in Point Clouds
Viaarxiv icon

Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining

Add code
May 10, 2025
Viaarxiv icon